We address 2D floorplan reconstruction from 3D scans. Existing approaches typically employ heuristically designed multi-stage pipelines. Instead, we formulate floorplan reconstruction as a single-stage structured prediction task: find a variable-size set of polygons, which in turn are variable-length sequences of ordered vertices. To solve it we develop a novel Transformer architecture that generates polygons of multiple rooms in parallel, in a holistic manner without hand-crafted intermediate stages. The model features two-level queries for polygons and corners, and includes polygon matching to make the network end-to-end trainable. Our method achieves a new state-of-the-art for two challenging datasets, Structured3D and SceneCAD, along with significantly faster inference than previous methods. Moreover, it can readily be extended to predict additional information, i.e., semantic room types and architectural elements like doors and windows. Our code and models will be available at: https://github.com/ywyue/RoomFormer.
translated by 谷歌翻译
我们建议使用两层机器学习模型的部署来防止对抗性攻击。第一层确定数据是否被篡改,而第二层解决了域特异性问题。我们探索三组功能和三个数据集变体来训练机器学习模型。我们的结果表明,聚类算法实现了有希望的结果。特别是,我们认为通过将DBSCAN算法应用于图像和白色参考图像之间计算的结构化结构相似性指数测量方法获得了最佳结果。
translated by 谷歌翻译
2型糖尿病(T2DM)中持续的高水平血糖可能会带来灾难性的长期健康后果。 T2DM临床干预措施的重要组成部分是监测饮食摄入,以使血浆葡萄糖水平保持在可接受的范围内。然而,当前监测食物摄入的技术是时间密集的,容易出错。为了解决这个问题,我们正在开发使用连续葡萄糖监测器(CGM)自动监测食物摄入量和这些食物组成的技术。本文介绍了一项临床研究的结果,其中参与者佩戴CGM时,参与者消耗了9份标准营养素的标准餐(碳水化合物,蛋白质和脂肪)。我们构建了一个多任务神经网络,以估计CGM信号的大量营养素组成,并将其与基线线性回归进行了比较。最好的预测结果来自我们提出的神经网络,该神经网络受试者依赖性数据训练,如均方根相对误差和相关系数所衡量。这些发现表明,可以从CGM信号中估算大量营养素组成,从而开发了开发自动技术以跟踪食物摄入量的可能性。
translated by 谷歌翻译